Improving part-of-speech tagging in Amharic language using deep neural network

نویسندگان

چکیده

To date, several POS taggers have been introduced to facilitate the success of semantic analysis for different languages. However, task tagging becomes a bit intricate in morphologically complex languages, like Amharic. In this paper, we evaluated models such as bidirectional long short term memory, convolutional neural network combination with and conditional random field Amharic tagging. Various features, both language-dependent -independent, explored model. Besides, word-level character-level features are analyzed deep models. A is utilized encoding at word character level. Each model's performance has on dataset that contained 321 K tokens manually tagged 31 tags. Lastly, best obtained by an end-to-end model, memory field, 97.23% accuracy. This highest accuracy competent contemporary currently existing

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Methods for Amharic Part-of-Speech Tagging

The paper describes a set of experiments involving the application of three state-ofthe-art part-of-speech taggers to Ethiopian Amharic, using three different tagsets. The taggers showed worse performance than previously reported results for English, in particular having problems with unknown words. The best results were obtained using a Maximum Entropy approach, while HMM-based and SVMbased ta...

متن کامل

Part of Speech Tagging for Amharic using Conditional Random Fields

We applied Conditional Random Fields (CRFs) to the tasks of Amharic word segmentation and POS tagging using a small annotated corpus of 1000 words. Given the size of the data and the large number of unknown words in the test corpus (80%), an accuracy of 84% for Amharic word segmentation and 74% for POS tagging is encouraging, indicating the applicability of CRFs for a morphologically complex la...

متن کامل

A Neural Network Approach to Part-of-Speech Tagging*

Neural networks are one of the most efficient techniques for learning from scarce data. This property is very useful when trying to build a part-of-speech tagger. Available part-of-speech taggers need huge amounts of hand tagged text, but for Portuguese there is no such corpora available. In this paper we propose a neural network that, apparently, is capable of overcoming the huge training corp...

متن کامل

Neural Network Approach to Thai Part Of Speech Tagging

Thai part of speech (POS) tagging is a challenged problem in natural language processing. Many techniques including artificial neural network techniques are suggested for POS tagging. Research works in Thai POS tagging so far only focused on assigning word types, but not word features. This paper proposed a technique using multilayer perception for tagging word features in Thai sentences. The f...

متن کامل

Amharic Part-of-Speech Tagger for Factored Language Modeling

This paper presents Amharic part of speech taggers developed for factored language modeling. Hidden Markov Model (HMM) and Support Vector Machine (SVM) based taggers have been trained using the TnT and SVMTool. The overall accuracy of the best performing TnTand SVM-based taggers is 82.99% and 85.50%, respectively. Generally, with respect to accuracy SVM-based taggers perform better than TnTbase...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Heliyon

سال: 2023

ISSN: ['2405-8440']

DOI: https://doi.org/10.1016/j.heliyon.2023.e17175